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BACKGROUND 



3. Field of the Invention 

Aspects of the present invention relate to the field of e-commerce. Other aspects 
25 of the present invention relate to a method and system to intelligently manage an 
infrastructure that supports an e-service business. 
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4. General Background and Related Art 

The expanding use of the World-Wide Web (WWW) for business continues to 
accelerate and virtual corporations are becoming more commonplace. Many new 
businesses, born in this Internet Age, do not employ traditional concepts of physical site 
5 location (bricks and mortar), on-hand inventories and direct customer contact. Many 
traditional businesses, who want to survive the Internet revolution are rapidly reorganizing 
(or re-inventing) themselves into web-centric enterprises. In today's high-speed Business- 
to-Business (B2B) and Business-to-Customer (B2C) eBusiness environment, a corporation 
must provide high quality service, scale to accommodate exploding demand and be 
10 flexible enough to rapidly respond to market changes. 

The growth of eBusiness is being driven by fundamental economic changes. Firms 
that harness the Internet as the backbone of their business are enjoying tremendous market 
share gains - mostly at the expense of the unenlightened that remain true to yesterday's 
business models. Whether it is rapid expansion into new markets, driving down cost 
15 structures, or beating competitors to market, there are fundamental advantages to 
eBusiness that cannot be replicated in the "brick and mortar" world. 

This fundamental economic shift, driven by the tremendous opportunity to capture 
new markets and expand existing market share, is not without great risks. If a customer 
cannot buy goods and services quickly, cleanly, and confidently from one supplier, a 
20 simple search will divulge a host of other companies providing the same goods and 
services. Competition is always a click away. 

eBusinesses are rapidly stretching their enterprises across the globe, connecting 
new products to new marketplaces and new ways of doing business. These emerging 
eMarketplaces fuse suppliers, partners and consumers as well as infrastructure and 
25 application outsourcers into a powerful but often intangible Virtual Enterprise. The 
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infrastructure supporting the new breed of virtual corporations has become exponentially 
more complex - and, in ways unforeseen just a short while ago, unmanageable by even the 
most advanced of today's tools. The dynamic and shifting nature of complex business 
relationships and dependencies is not only particularly difficult to understand (and, hence 
5 manage) but even a partial outage among just a handful of dependencies can be 
catastrophic to an eBusiness' survival. 

Businesses are racing to deploy Internet enabled services in order to gain 
competitive advantage and realize the many benefits of eBusiness. For an eBusiness, 
time-to-value is so critical that often these business services are brought on-line without 

10 the ability to manage or sustain the service. eBusinesses have been ravaged with 
catastrophe after catastrophe. Adequate technology, to effectively prevent these 
catastrophes, does not exist. 

eBusiness infrastructures operate around the clock, around the globe, and constantly 
evolving. If a critical supplier in Asia cannot process an electronic order due to 

15 infrastructure problems, the entire supply chain may come to a grinding halt. Who 
understands the relationships between technology and business processes and between 
producer and supplier? Are they available 24 hours a day, 7 days a week, 365 a year? 
How long will it take to find the right person and rectify the problem? The promise of 
B2B, B2C and eCommerce in general will not be fully realized until technology is viewed 

20 in light of business process to solve these problems. 

Web-enabled eBusiness processes effectively distill all computing resources down 
to a single customer-visible service (or eService). For example, a user interacts with a 
web site to make an on-line purchase. All of the back-end hardware and software 
components supporting this service are hidden, so the user's perception of the entire 
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organization is based on this single point of interaction. How can organizations mitigate 
these risks and gain the benefits of well-managed eServices? 

Never before has an organization been so dependent on a single point of service 
delivery - the eService. An organization's reputation and brand depend on the quality of 
5 eService delivery because, to the outside world, the eService is the organization. If service 
delivery is unreliable, the organization is perceived as unreliable. If the eService is slow 
or unresponsive, the company is perceived as being slow or unresponsive. If the Service is 
down, the organization might as well be out of business. 

Further complicating matters, more and more corporations are outsourcing all or 

10 part of their web-based business portals. While reducing capital and personnel costs and 
increasing scalability and flexibility, this makes Application Service Providers (ASPs), 
Internet Service Providers (ISPs) and Managed Service Providers (MSPs) the custodians 
of a corporation's business. These "xSPs" face similar challenges - delivering quality 
service in a rapid, cost efficient manner with the added complication of doing so across a 

1 5 broad array of clients. Their ability to meet Service Level Agreements (SLAs) is crucial 
to the eBusiness developing a respected, high quality electronic brand — the equivalent of 
prime storefront property in a traditional brick and mortar business. 

The Internet enables companies to outsource those areas in which the company 
does not specialize. This collaboration strategy creates a loss of control over infrastructure 

20 and business processes between companies comprising the complete value chain. Partners, 
including suppliers and service providers must work in concert to provide a high quality 
service. But how does a company control infrastructure which it doesn't own and 
processes that transcend its' organizational boundaries? Even infrastructure outsourcers 
don't have mature tools or the capability to manage across organizational boundaries. 
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The underlying problem is not lack of resources, but the misguided attempt to 
apply yesterday's management technology to today's eService problem. As noted by 
Forrester Research, "Most companies use 'systems' management tools to solve pressing 
operational problems. None of these tools can directly map a system or service failure to 
5 business impact." To compensate, they rely on slow, manual deployment by expensive 
and hard-to-find technical personnel to diagnose the impact of infrastructure failures on 
service delivery (or, conversely, to explain service failures in terms of events in the 
underlying infrastructure). The result is very long time-to-value and an unresponsive 
support infrastructure. In an extremely competitive marketplace, the resulting service 
10 degradation and excessive costs can be fatal. 



The present invention is further described in the detailed description which 
follows, by reference to the noted drawings by way of non-limiting exemplary 
embodiments, in which like reference numerals represent similar parts throughout the 
1 5 several views of the drawings, and wherein: 

Fig. 1 shows a high level description about the input and output of a behavior 

expert; 

Fig. 2 describes in more detail the internal structure of an behavior expert and the 
relationship between the behavior expert and outside world; 
20 Fig. 3 shows a high level block diagram of a local service management system; 



BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 4 illustrates the internal organization of a behavior expert; 



Fig. 5 shows the process of behavior analysis within a behavior expert; 



Fig. 6 illustrates the relationship among GDS, the BeXs, and the GDOs; 



Fig. 7 illustrates how the data move moves from data providers to the rules and 



25 ultimately triggers events; 
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Fig. 8 shows the organization of BeX variables; 

Fig. 9 shows a high level organization of a local service manager and the BeX 
compiler; 

Fig. 10 shows how a behavior rules generates an event based on states; 
5 Fig. 1 1 illustrates how different BeXs can be linked based on a variety of internal 

controls; 

Fig. 12 illustrates fundamental concepts of building a dependency network; 

Fig. 13 shows an dependency relationship is created by sharing controls among 

BeXs; 

10 Fig. 14 describes exemplary topologies created by dependency relationships; 

Fig. 15 illustrates ways of building complex, multi-tiered analysis systems; 

Fig. 16 illustrates how BeXs share information through the use of a blackboard 

server; 

Fig. 17 is a general block diagram for the adaptive feedback mechanism; 
1 5 Fig. 1 8 illustrates a more detailed diagram, in which Adaptive BeXs are used for 

adaptive feedback control; and 

Fig. 19 shows an example of adaptive feedback control. 

DETAILED DESCRIPTION 

20 An embodiment of the invention is illustrated that is related to behavior experts to 

be used in an eService management system. The present invention enables intelligent 
eService management by incorporating the knowledge about eService business process 
into behavior experts at different levels of the eService management in a distributed 
fashion so that the eService business model dictates the infrastructure management 

25 strategy to ensure eService delivery. 
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A Behavior Expert (BeX) is a distributed, autonomous intelligent agent in a 
eService management system, designed to detect, analyze, predict, and control certain 
behavior of the components of a business infrastructure that supports an eService. A BeX 
may be attached to a component (or application) of an infrastructure that supports an 
5 eService so that the operational status or the behavior of the component may be 
dynamically monitored and adaptively adjusted to optimize the eService performance. 
Fig. 1 illustrates a BeX. 

In Fig. 1, a BeX is attached to an application or component. A BeX may analyze a 
wide spectrum of sensor data from a set of data providers (acquired from the components) 

10 and make decisions about the behavior of the components. Component behavior is 
detected based on a collection of rules. Fig. 2 shows in more detail the construction of a 
BeX and its connections with other parts of an eService Management System. In Fig. 2, 
observation data acquired by data providers are sent to a General Data Server. GDS 
generates generic Data Objects. A BeX is constructed based on a set of variables, states, 

15 events, and rules. Rules use variables as their basic building blocks. These variables are 
populated from GDOs generated by GDS (which, in turn, receives data from data 
providers). When abnormal behavior is detected, the BeX generated a set of events that are 
formatted in Uniform Data Model. Such events may be shared with other BeXs through a 
blackboard server where the BeX may post its events. 

20 Rules employed by a BeX may be non-procedural and are used by the BeX's own 

inference engine to assemble evidence in order to pursue goals. Each BeX implements a 
model of application metrics. A collection of BeXs may interact with one another in a 
dynamic fashion and together it comprises a model of operational and performance 
metrics for the complete system. Such interactions form flexible topologies among BeXs 
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to enable multi-tiered, aggregated, and meta-analytic analysis under a multi-stage BeX 
architecture. 

There may be various kinds of BeXs. For example, a BeX can be a physical (or 
coupled) BeX, a logical BeX, or a functional BeX. A physical or coupled BeX is attached 
5 to a running component in the infrastructure. This may be the most common form of a 
BeX. A physical BeX is a behavior module that tracks and responds to the changes in 
performance of the running component through a series of metrics. 

A logical BeX uses the information from other BeXs to analyze the performance of 
a component collective. The • dependency on the information from other BeXs may be 
10 described by a dependency tree and a logical BeX may correspond a node locus in the 
dependency tree. 

A functional BeX is a behavior expert that specializes in a particular task. It acts 
like a small program and is called from another BeX through one of two methods: a 
traditional CALL <BeXid> or the functional form: <x> = BeXid(parmi,parm 2 ,...,parm n ), 
15 where BeXid is used to uniquely identify a BeX and parmi,parm 2 ,...,parm n . are the 
parameters passed to the functional BeX. Functional BeXs provide repositories of 
distributed and shared knowledge, establish business process modeling support, and 
encapsulate service policies within the organization. 

A BeX can perform a variety of actions, including sending events (messages) to 
20 other BeXs, to local and global intelligence centers of the eService management system, as 
well as directly to the eService management front-end. They play a pivotal role in an 
eService management system and implement the business process modeling support for 
the eService at various levels of details. 
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A BeX attached to a particular running component in an e-service infrastructure 
performs behavior analysis through a collection of variables and rules that are associated 
with the running component. A BeX acts on the variable, usually instantiated by the 
observations acquired from the running component and detects any behavior that is not 
5 acceptable according to rules. Such a BeX contains not only a dictionary of the variables 
but also the transitional states and the rules that carry out the transitions from variables to 
states and from states to events. A rule defines some violation of acceptable behavior and 
may take the form: 



10 where "name" is the identifier of a particular rule, "if-premise" describes a condition, 
"then action" describes the action to be taken when the condition satisfies, and "else 
action" describes the action to be taken when the condition does not satisfy. 

The rules may be designed to enforce the performance requirements, imposed on 
all the running components of an eService infrastructure to support the e-service. 
15 Therefore, the eService business process model, through such requirements, dictates how 
an eService management carries out its tasks by making all levels of eService management 
(including BeXs) aware of the purpose and the impact of the infrastructure with respect to 



20 system. Data providers send observation data to the General Data Server. GDS makes 
observation data accessible to the BeXs in a uniform object form (Generic Data Object). 
Each BeX is connected to a General Data Server from where the observation data from 
infrastructure components can be accessed. Each BeX acts upon the observation data, 
updates states of the running components that it is managing, and generates events. Such 



name, if-premise then action else action; 



the eService. 



Fig. 3 shows how BeXs interact with other parts of an eService management 
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events are the results of behavior analysis performed by the BeX and reflect the behavior 
of the running components. 

BeXs communicate with the outside world through a family of coherent, well- 
formulated events. These events combine explicit rule-centric information with implicit 
5 information available to the BeX in its current state. Events are either shared among the 
BeXs in the same system or routed into the Local Ecology Pattern Detector. This 
mechanism either dispatches the event directly to the eService layer (if it has a very high 
priority) or it absorbs the event and uses it to build clusters of emerging patterns. 

Both singleton events (those that pass directly through Local Ecology Pattern 
10 Detector) and composite events (those that represent ecological patterns and detected by 
Local Ecology Pattern Detector) are routed through the dispatcher and stored in a Global 
Data Repository. These events are read by the integration BeX using the database data 
provider pipeline. The iBeX incorporates high level intelligence to compute a service level 
indicator. 

1 5 We note that there are, in fact, two kinds of Events that can reach the eService 

Manager from the local service manager (the internal, machine resident collection of 
BeXs). The first is any event with a high priority. Such event will be routed through to the 
eService Management front end. The second is any event that has been initiated by Local 
Ecology Pattern Detector and represents an anomalous pattern. These events are 

20 component or process specific, but, nevertheless, represent an aggregate of many 
individual events. The eService Management front-end may need to distinguish between 
the two. 

Each BeX incorporates (or includes) a collection of data sources. These data sources 
expose all the underlying metrics available on that platform and for the applications or 



-10- 




Panacya Ref: Bex: . ' 

L & A Ref: Ol-PAN-04 

services running on that platform. The data source definition in a BeX defines the tables of 
symbolic names used by the variables to couple themselves to the associated data 
provider. A variable has a set of integral properties: synchronization, data types, name, 
sapling rate, and the data provider's name. 

5 When a BeX receives observation data from GDS, it populates its variables. The 

updates variable values may trigger the propagation of variable values to a set of States. 
Such propagation is achieved through a set of metric rules. Furthermore, the changes in 
States may trigger some events which are detected through a set of behavior rules. Fig. 4 
shows how these parts link together in a BeX. 

10 A metric is a threshold violation. Metrics, as the name implies, measure some state 

of a system (service, component, or application) based on the violation of some specified 
performance criteria (which may be expressed as a crisp or fuzzy threshold.) In a BeX, 
metrics are embedded in the premise of a rule. Metrics combine variable values and 
threshold equations using, for example, crisp or fuzzy operations: 



Each metric is shown in the "If clause. The first rule uses a crisp notation. The 
second rule uses fuzzy sets (veryHigh and Indicated) to describe the threshold states. 

A metric rule is a rule that uses a metric to instantiate (give a value to) a State 
20 variable. A metric rule access metric thresholds through the BeX's variables (which are 
pipelines via the GDS to the underlying data providers). As Figure 8 illustrates, metric 
rules execute to populate States and Behavior rules execute to examine the condition of 
these States. 



15 



If iCPUTime > 80 then CPU ALERT - True; 



If iCPUTime is veryHigh then CPU_ALERT is Indicated; 
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Metric rules can only set the value of a State - they cannot initiate messages 
(events). The frequency with which metric rules are fired is the minimum sampling rate 
for all the variables used in the collection of rules. 

Behavior rules execute to examine the condition of States. Behavior rules may 
5 have a name, combine multiple implicit or explicit States, and generate a result (such as 
sending an event to the eService management front-end). Behavior rules are the core 
intelligence instrumentation in a BeX. A behavior rule can have an execution frequency as 
well as an explicit degree of severity. 



10 the condition of the rule is satisfied. There may be various types of events. For example, 
an event may be one of the following types: hidden, local, or external. The type may be 
determined by the message's visibility. Hidden events are shared among dependent BeXs. 
Local events are shared between BeXs and the Local Ecology Pattern Detector, as shown 
in Fig. 3. External events are routed through the dispatcher to a Global Data Repository so 

1 5 that either the Global Ecology Pattern Detector or the eService management front-end can 
respond immediately to higher priority events. 

As shown in Fig. 5, events are generated by behavior rules in a BeX. The states 
are instantiated by metric rules. Behavior rules act on the instantiated states to perform 
behavior analysis. When conditions of behavior rules are satisfied, the rules are fired to 
20 generate events. 

Each BeX may be responsible for a different infrastructure component but different 
BeXs may share information whenever it is necessary. BeXs may share information in 
different ways. For example, a one-to-many relationship may exist between BeXs. 
Dependencies among BeXs may form complex topologies in a eService management 



An event may be a message generated by a behavior rule as an action taken when 
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system. A common use of the dependency specification is the creation of hierarchical or 
tree structures among BeXs. Other possible topologies can be networks, star patterns, 
acyclical graphs, and plex structures. 

A local eService Management System is the machine resident component of a 
5 eService Management system. A local eService Management System connects behavior 
experts (BeXs) to running applications, tracks the application's performance using metric 
and behavior logic stored in the BeXs, and communicates any irregularities to the eService 
Management front-end. Below, aspects of BeXs are described in details. 



10 information to the General Data Server (GDS). This object handles both synchronous and 
asynchronous data feeds. Asynchronous feeds (such as the OS Filter Driver) use a form of 
push technology to send data. Synchronous feeds are coupled to the Aware monitoring 
software and provide data when called. A scheduler is also used to sample synchronous 
data on a regular basis. Each BeX is coupled to the General Data Server (which is its only 

15 source of data). 

BeXs constitute the molecular structure of the eService Management facilities. 
Each BeX contains four elements: local variable definitions, metric rules, explicit local 
dependencies, and behavior rules. It is the set of behavior rules that actually affect the 
generation of events. As an example, a very simple BeX may look something like this: 



In Fig. 3, Data Providers are platform specific executables that acquire and deliver 



20 



BeXID : 



MyBeX 



CREATED : 



10MAY2000 15:18:12 



COMPONENT : 



C : \SVER0 1\SAP2\ I \SHOES . EXE 



DATASOURCE : 



SOLARIS1, UNIX 



25 



COMMONEVENTS : SHOES . EVE 



// 



STATES { 
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Boolean 
Boolean 



CPUAlert 
DISKAlert 



10 



15 



VARIABLES { 
Synch int 
Synch int 

} 



iCPUUsed sample 10 RM . GETCPUTIME 
iDiskRem RM . GETDI SKLEFT 



METRIC RULES { 

If iCPUUsed > 80 then CPUAlert = TRUE; 
If iDiskRem < 20 then DSKAlert = TRUE; 

} 

BEHAVIOR RULES { 

MyRulel. [Freq 10] If CPUAlert and DskAlert then 

Send Event (bAware, a CRITICAL, INSUFFICIENT- RESOURCES, ScAppName) ; 
End if 

} 

20 

The VARIABLES section defines working variables. We need these as "handles" 
so that we can rules. This section indicates the data type, the variable name, and the data 
source. The easiest way to identifier the source is simply to specify the GDS method that 
we could normally use to acquire the data. I can imagine that more advanced features of 
25 this section could include sampling rates, filters, arithmetic and logical expressions, and so 
forth. 

The METRIC RULES actually handle the evaluation of thresholds. Metric rules 
generally implement operational dependencies for the application as well as additional 
metrics that might be added to observe the application's behavior. The premise described 
30 in a metric rule may contain any number of conditions connected by AND or OR to form 
complex logic. These rules may handle both numeric and string data. String operators 



-14- 



Panacya Ref: Bex: 
L&ARef: 01-PAN-04 

include such functions as HAS, CONTAINS, OMITS, LEFT, RIGHT, SUBSTRING, 
ISIN, TRIM, UPPER, LOWER, FIRST, LAST, THISWORD, GETWORD. The ability to 
handle string information is important for metrics that look at log files (with these 
operators we can see if a log file line contains or omits some value, as an example). 
5 The action described in a metric rule is used to set a State Variable. A state may be 

a Boolean state and can be set to the constant TRUE or FALSE. The default for a State 
Variable may be set as False. These states are the conditions normally used in the behavior 
rules to decide whether or not to generate an event. 

The BEHAVIOR RULES generate events. In this section we combine the states 

10 from the metric rules section. Note that the values of the original variables (and any 
defined but not used in the METRIC RULES section) are still available in this section, 
thus allowing these rules to use a wide spectrum of data. The actual event generated 
depends on the logic of the rule. An event is derived from a more general and flexible 
message class. This means that it can take a variety of forms. In the previous example, we 

15 have the target receptor for the even, the criticality, the event class, and the event message 
(which in this case is the name of the application inserted as a built-in symbolic variable). 

The Behavior Rules section also supports complex logic. Aarbitrarily nested if- 
then-else rules may be allowed. These rules may normally issue the Send Alert action. 
However, there is no reason that the result of a rule evaluation could not be the execution 

20 of a script, the acquisition of additional data, a change to thresholds (thus we have 
rudimentary adaptive BeXs), or even the investigation of another BeX's status (how is 
unknown right now). 

When eService Management System discovers an application (or knows that an 
application is connected), it creates a BeX. The BeX takes the native BeX object, compiles 

25 the rules, handles dependencies, thresholds, and data connections, and produces an 
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executable policy that is linked into eService management System's internal Directory of 
active BeXs. The eService Management System cycles through the BeX objects at some 
sampling rate. During each cycle all the rules are executed. When some threshold is 
exceeded (actually, when some rule initiates an action), an event of some class and type 
5 can be generated - whether or not an event is dispatched depends on the Behavior Rules 
logic 

A BeX contains two central and inter-connected rule dictionaries. The first rule set, 
METRIC RULES, defines threshold violations and set global state variables. The second 
rule set, BEHAVIOR RULES, interprets the state variables established by the Metric 

10 Rules and take some action (such as signaling an event notice back to the eService 
Management System's front-end). Figure 8 shows the flow of control logic in the two rule 
sections and illustrates how they are functionally connected. 

Metric Rules may be optional (but necessary) components of the BeX. They 
identify and name specific states within the system. These states have, by default, the 

1 5 External attribute. As we will see later, the implicit dependency relationships among BeXs 
may work in several modes: variable, states, and rules. In general, the tiering of 
application performance BeXs is done through the illumination of fired rules in 
subordinate (dependent) BeXs. 

Each Metric Rule is connected to a single General Data Server (GDS) pipe. These 

20 pipes are either synchronous or asynchronous (the difference is discussed later). A BeX is 
compiled and installed in the system as an active, event-driven object. It is attached to the 
General Data Server through a registration process that identifies its data needs. The set of 
declared data provider requirements is automatically updated by the GDS based on the 
sampling or refresh rate of the connection. Figure 9 illustrates the relationship between 

25 the GDS, the BeXs, and the General Data Object (GDO). 
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The registration process allocates a group of slots (linked nodes) in the GDS that 
are allotted to and connected to the BeX instance. These nodes will contain the data 
packages from the GDS that correspond to the data elements requirements during 
registration. Once connected to the GDS, a BeX listens to the data input buffer (the 
5 collection of GDO instances) for data packages that belong to one of its rules. When 
found, the items are read and. the GDO at that slot position is cleared (deallocated). 

Data flows through a BeX are initiated by the call back methods associated with 
the BeX's variable dictionary. The General Data Server triggers these methods. Figure 7 
illustrates how the data moves from the data providers through the metric and behavior 
10 rules and ultimately can trigger some event. 

When a behavior rule emits an event, we check with the sampling filter to see 
whether or not the event should actually be transmitted. If, as an example, if the rule has a 
clock filter of 10:6 (10 times in six seconds) then we look at the time since the last event 
and the count of event generations. If the count is greater than or equal to six and ten 
1 5 seconds have elapsed we send the event, otherwise we simply increment the event counter 
and leave (no event is sent). When an event is sent, the elapse time and the event counter 
are reset to zero. 

Behavior rules in a BeX are generally not point-in-time productions. Rather, they 
analyze changes in the application state. These changes are reflected as periodic threshold 
20 violations, average violation quantifiers, or some increase or decrease in the change (that 
is the rate of change or the degree of change). We might have rules that use antecedent 
expressions like, 



If X > 1 on a regular basis 



25 



If X is increasing 
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If X is rapidly increasing 
If delta(X t ,X t _0 is large 
If X occurs M times in N time periods 
If avg(X) is above threshold 

5 

Further, the data returned from the data providers is often not a single (scalar) 
value. Instead, the data provider can return a vector of values. As an example, the resource 
DiskSpace would returns a vector of 2 tuple value, one tuple for each disk: 
((C,201)(D,97)(E,2065)(J, 16701)). The cardinality of the vector depends on the number of 
10 disks attached to (or visible to) the data provider. This means we will have rules that 
implicitly or explicitly loop over these vector elements. This means a variable definition 
such as, 

Synch int iDiskSpaceO sample 80 dp_nt_rm. diskspace 

15 

Produces a multi-dimensional variable. Our rule language must not accommodate 
an access to these implicit arrays (or matrices). Rules now can take on such forms as: 




For iDiskSpace.DiskldfCYD") 
20 If iDiskSpace.DiskCapacity < 80 then 

Sendevent(Urgent, n Out of Disk Space"); 

End for 



In order to accommodate these kinds of rules, the Behavior Module incorporates 
25 two interconnected features: a sampling clock and a matrix of variable data (instances and 
their historical values). With these two capabilities we can form rules that measure the 
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change in a variable from state to state. This allows the rule handler to detect and predict 
the state of the system in the current as well as future periods. This capability is also 
crucial to the workings of the adaptive feedback system. 

In order to implement this kind of analysis, we need to introduce a collection of 
5 time (or vector) functions into the behavior and metric rules. This necessitates a 
fundamental change in the way we visualize a variable. Variables are now multi- 
dimensional time series or linear vector objects with a chronological array of data 
elements. The horizontal axis holds the historical data. The vertical axis is the instance 
axis. The dimensionality of horizontal axis is modulo-N, where "N" is the time horizon. 
10 We can isolate a particular instance row with the for keyword (the extended form of which 
includes foreach, forany, and forall). We can index a variable along the horizontal axis 
with the built-in time index (t) or we can use one of the time access functions to abstract 
statistical information about the data. Figure 8 illustrates the organization of a BeX 
variable. 

1 5 Many of the functions involve a moveable time horizon window. This window is 

specified in terms of the timeoffset and the periods parameters. We specify the start of the 
data value as a timeoffset (thus, 0=Tnost recent or current data value, l=the last or previous 
value, 2=the one immediately before the previous, and so forth.) The timeoffset is in the 
form of a time expression (texp). Thus, t-0 is the current period, t-1 is the previous values, 

20 etc.) The texp can be any arithmetic expression that produces a number in the range [0,N- 
1]. The periods parameters indicates how many periods are used in the calculation. If 
omitted, the remainder of the time periods starting at the timeoffset is used. Thus, if a 
variable X has 10 time periods, the expression avg(X,t-4) uses periods [5] thru [9] all time 
series are zero based.) 
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Avg avg(varid {.timeoffset {.periods}) Computes the mean or average of the data 

vector. 

Count count(vtfn'd {.timeoffset}) Returns the count of the actual number of values 

in the lag data vector beginning at any timeoffset. 

5 Frequency frequency(varid.exp {.timeoffset} {.periods}) Returns the number of times 
the value indicated by exp appears in the lag data vector. 



10 



Last 
Max 
Maxfreq 

Median 
Min 

Minfreq 



15 Mode 



Previous 



\&st{varid {.periods}) Returns the last data value in the time series. 

max(varid {.timeoffset} {periods}) Returns the maximum value. 

maxfreqfyarid {.timeoffset} {periods}) Returns the number of times the 
maximum value in the series appears in the lag data vector. 

median(varid {.timeoffset} {.periods}) Returns the mediam data value. 

min(varid {.timeoffset} {.periods} {.threshold}) Returns the minimum value. 

minfreq(varid {.timeoffset} {, periods}) Returns the number of times the 
minimum value in the series appears in the lag data vector. 

modQ(varid {.timeoffset} {.periods}) Returns the Mode of the data 
distribution (note that this is a relatively expensive operation since the lag 
data vector must be sorted.) 

piQvious(varid {.timeoffset}) Returns the previous value from the lag data 
vector beginning at any specified timeoffset. 



20 Regularly reg\x\arly(vexp {.percentage} {.timeoffset} {.periods}) Returns a Boolean 
indicating whether or not the variable expression (vexp) when evaluated 
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against each of the historical values occurs more than the indicated 
percentage. As an example, the function, 

regularly(iCPUUSE>80,50) 

returns true if the ICPUUSE value in each time period is greater than 80 in 
5 50% of the cases. The percentage can be used to implement such semantics 

as occasionally (>10), often (>25), frequently (>40), usually (>50), mostly 
(>75), nearly always (>85) and always (>97). Naturally these numbers are 
model dependent and only given as an example. 

Sdiff sd\ff(varid{Jimeoffset} {.periods}) Returns the sum of the differences 

10 between the lag data vector values. 

Strend strend(varid{,timeojfset} {.periods}) Returns the slope coefficient for the 

series in the range [-1,+1]. This is degree to which a polynomial least-ques 
regression line has a positive or negative slope. This is a predictor function. 

Var[t] varid[texp] Explicitly selects a cell in the lag data vector using a texp. All 

1 5 variables have a default selector of var[t], that is, the current value. 



The rule handler also provides several built-in values that describe the current state 
of the variable and its time series. In some cases these can be used to re-adjust the state of 
the variable. 

20 

var.periods (function) returns the total number of time periods associated with the 
variable. 
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Var.time var.time( {begin} {,end} {, slice}) A directive and a function. Returns the 
current timeoffset associated with the variable. As a BeX directive, this also 
changes the time horizon used by all the rules that access the associated 
variable. Thus X.time(l,5) restricts all functions to time period 1 (the 
5 previous data) out to time period five. However, X.time( 1,20,2) restricts the 

variable to periods 1 thru 20 with a step function of 2 (that is, every other 
value). X.time(BEGIN) returns the start. X.time(END) returns the ending 
period. These are built-in keywords. X.time() restores the time horizon to 
its default values. (We need to see if this kind of time control is really 
1 0 necessary before implementing such a complex control mechanism). 

The rule architecture is consequently affected by this change in the variable 
structure. Rules must be able to exploit the higher and richer dimensionality of the 
variables. A rule must also be able to isolate regions within the underlying instance and 
15 lag data space. Thus, rules become more script-like in their organization, allowing the 
designer to loop over the horizontal and vertical axes, perform flow of control operations 
(for, while, if, until, and do), access elements through subscripts, isolate sub-matrices 
(with, step, by). We can write a rule such as, 



20 With iDiskSpace.DiskId(&ThisApp.ResidentDiskId) 

{ 

if iDiskSpace.DiskCapacity < 80 then SendEvent(); 
if iDiskSpace.PctFull > 90 then SendEvent(); 

} 
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Note that the first sub-rule sends an event when it finds a disk with a storage 
capacity of less than 80 Megabytes (not the available remaining space, which would be the 
method, RemainingSpace). Rule work with implicit looping over any unrestricted 
dimension. Thus, the statement, 

if Avg(iDiskSpace). . . . 



Takes the average disk space of the entire NxM data matrixr On the other hand, a 
statement such as, 

10 

For iDiskSpace.Diskldf'C") 
if Avg(iDiskSpace) .... 

Computes the average disk space only for the C: disk. And, 

15 

For iDiskSpace.DiskCapacity >50 
if Avg(iDiskSpace) .... 



Computes the average disk space for all the disks with an available capacity of 
20 over 50 Megabytes. 

To implement this feature Variables need an historical array of data. Rules also 
need a frequency histogram (or other such pattern recognition feature) to record the 
number of events issued within their frequency time frame. The predicate clock or 
sampling calculation must also match both the frequency of the rules and the sampling rate 
25 of the variable 
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Figure 9 shows the high level organization of a local service manager and the BeX 
Compiler (which generates the BeX objects stored in the local service manager's 
directory). 

The implicit information comprises the BeX identification, the Node (location of 
5 service), the date/time stamp, the rule identifier, and the rule's degree of truth. Explicit 
event information provides categorization and classification information necessary to 
aggregate or summarize information. Each event has four related attributes: 




The fundamental type of the event. This can one of the following 
symbolic constants: MAINT, PERFORM, and INTERNAL, The 
Maintenance Group specifies events that are not related to the issues 
of performance (thresholds and metrics). The performance Group is 
the principle event family dealing with behavior violations and 
notices. This is the Group of events that is principally intercepted by 
LECO and also used by the eServices Manager to control the 
display of status in the model hierarchy. The Internal Group are 
events that are intended for dependent BeXs within the same 
machine or server environment. 

20 Class The class of event information. There are five intrinsic (built-in) 

classes: OS, APP, SYSTEM, NET, XTRAN. The user can define 
additional event classes. 

Measure Within the class, the type of measure. There are six intrinsic (built- 
25 in) measures: AVAIL, VOLUME, RESPTIME, TRANRATE, 
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THRUPUT, and FAULTS. The user can define additional event 
measurement types. 



Specificity The ClassxMeasure couplet can also be qualified according to its 
analytical specificity. There are two possible values along this axis: 
QUALITATIVE and QUANTITATIVE (also specified as WEAK 
and STRONG). Specificity is also a factor in the use of fuzzy rules 
indicating the possible degree of elasticity in the model 
measurement. 



The following matrix shows the relationship between Class and Measure. 
Although these are organized in a matrix, not all relationships might be valid « this might, 
as an example, be particularly true of the SYSTEM (the system call driver data stream). 

Measure 

Class AVAIL VOLUM RESPTIM TRANRA TRHUP FAULT 

EE TE UT S 

os r t~ — i : ~ 

APP ~~~ f * 

SYSTEM " 

NET " ~" 

XTRAN t " " ' 1 1 



The fundamental characteristics of an event is specified in the Events section of the 
BeX. Each BeX can establish its own vocabulary of events and it can also include a global 
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or shared definition of common events. A collection of global or commonly shared events 
can be specified in the CommonEvents section of the header. Like the DataSources 
specification this statement indicates a collection of previously defined and shared event 
definitions. To declare an event, we give it a unique name and declare its properties. The 
5 general syntax is: 

Events . 

eventld Group, Class .Measure, Specificity "message" 

10 

where message indicates the information string that is transmitted with the event. If a 
message text is not specified, it can be included in the actual event action of the rule (see 
blow). With this kind of declaration we can then use the SendEvent action of the behavior 
rule language to complete an event and send it to either the eServices layer or to the LECO 
1 5 pattern organizer. The SendEvent action has the following general syntax, 

SendEvent(evett//d, priority, severity {.message}) 

Where, 

20 

eventld (string) The identifier of the event as it occurs in the Events section 

of the BeX (or as it occurs in the CommonEvents header include 
file.) Only events that have been defined in these two regions can be 
transmitted by the SendEvent rule action. 

25 
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priority 



(integer, [0,10]) is the urgency of the event. The smaller the 
number, the higher the priority. This event parameter affects the 
way the event is handled by the local ecology system. A priority of 
zero (0) is automatically routed directly through LECO to the 
eService database for immediate action. All other priority events are 
held by LECO where they are classified and used in the emerging 
pattern analysis (where priority plays an important role in the way 
patterns are interrelated.) 



10 



15 



20 



25 



severity (float, [0,1] or [0,100], psychometric scaling) The degree of 
"damage" associated with this event (used primarily with 
PERFORM group events, but not restricted to this group). The 
severity is a measure along the psychometric scale of the impact this 
rule firing has on the performance of the associated component (or, 
for a logical or virtual BeX, on the performance of the composite 
system.) With the addition of fuzzy behavior rules in the near future 
this severity will be the product of the defuzzified solution scalar 
vector and the degree of evidence in the solution (the compatibility 
index of the solution fuzzy vector). 

message (string) A text message describing the event. If a message was not 
declared with the event, it can be added as a parameter in the action. 
If a message exists with the declared event, this message will 
replace the defined text (unless the text starts with a plus sign, 
in which case it is appended to the declared text.) 
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When a behavior rule is fired (its premise or antecedent conditions are true), we 
can send an event notice to the Aware front-end. This notice is used by such functions as 
eServices Manager to illuminate system problems. In addition to the explicit information 
5 associated with an event (the combination of the event declaration properties and the 
SendEvent parameters), the event also contains a compacted collection of internal or 
implicit data. This is provided automatically by the SendEvent operation. The following 
layout provides a complete description of the emitted transaction. 

10 Semaphore (Byte) Indicates the aggregation methodology used for this event. 



Time 



(string) The date/time stamp for the event. This maintains 



chronological order in the bDB database. The data in the form 



yyyymmddhhmmss. 



15 



Node 



(integer) Location of the service 



Component (string) The application throwing the event. This is the name of the 



application monitored by the Behavior Module. 



20 



BeXId 



(string) The identification of the BeX throwing the event. An 



application may have multiple BeXs, thus we need to know exactly 



which BeX is reporting this event. 



25 



Group 



(integer) The symbolic constant value. 
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Class (integer) The symbolic constant value. 



Measurement(integer) The symbolic constant value. 



10 



Priority (integer) 

Ruleld (string) The identification of the Rule in the BeX that fired (or 

didn't fire - see next column of data). 

Severity (float) 



Degree 



15 



20 



Data 



(float, [0,1]) Used with fuzzy inferencing. Reflects the degree of 
evidence used to develop the Severity level. 

(string) A package of data associated with the rule execution. This 
is the expanded rule buffer. By "expanded" we mean that the value 
of the variables are encoded with the rule. Thus, for a rule fragment 
such as, 

if iCPU Avail < 100 then 

the Buffer would contain, 



25 



if iCPU Avail {80} < 100 then 
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in this way the receiving interface, if necessary, can parse out the 
actual values that triggered the rule. Braces are used since these are 
not valid lexical elements in the rule syntax. 

5 

A reference to a GDS pipe is through a locally defined and explicitly typed 
variable. A variable can be dynamic (dv), external (dx), or static (sv). Static variables have 
values that persist through subsequent executions of the behavior Module (and thus can be 
used as accumulators or for other kinds of global control). Variables are explicitly defined 
10 before any of the Metric rules. A variable definition has the form: 




(PushType) StorageType DataType Varld (SampleRate) GDSpipe 

Where: 

15 

PushType is the availability scheduling mechanism associated with the 
variable. This can be Synchronous or Asynchronous (or Synch and 
Asynch). If not specified then Synchronous is assumed by default. 

20 StorageType is the optional storage class designator (Dynamic, External, Shared, 

or Static). If not specified, Dynamic is assumed. A Dynamic 
variable is local to the BeX and is always attached to an data 
provider source through the GDS pipeline. A Static variable is local 
to the BeX, is initialized when the BeX is compiled and loaded, and 

25 is generally not attached to a data source. An External variable is 
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visible to all the BeXs in the active Aware system. An External 
variable can be referenced in another BeX, however, it must have 
the Shared data type designator in all but the original BeX. 

DataType is the type of data this variable can hold. The variable types can be 
integer (int), string, float, double, or Boolean. Each variable must 
have an explicit data type specification. 



Varld is the name of the variable. The name can be one to thirty-two 

10 characters in length (and must start with either the underscore or an 

alphabetic character). Variable names are not case sensitive. Each 
variable name in the BeX must be unique. If the variable name has 
the external data storage attribute, it must be declared as shared by 
all other BeXs except the one where it is originally defined. A static 
1 5 and external variable can also have an initialization value (this value 

is assigned only once - when the BeX is compiled and loaded). 
Only static and external can be used together. 



SampleRate is the rate at which synchronous data variables are populated. 

20 

GDSpipe is the data source descriptor. This string defines the complete 
General Data Server method declaration that is used to retrieve a 
data package (parcel) from the target data provider. 
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Any number of variables can be defined in a single BeX. Many variables can have 
the same GDSpipe specification. A locally defined dynamic variables can only be used in 
the antecedent of a metric rule (state variables appear in the consequent or action part of 
the rule). The definition of variables is indicated by the VARIABLES keyword in the BeX 
5 definition file (not case sensitive). As an example, 

VARIABLES { 

int iCPUUsed (1000) RM . GetCPUTime ; 

Static int iTimesExecuted=0 ; 

10 } 

A metric rule instantiates a state variable. State variables are defined in the State Context 
section of the Behavior Module. The collection of instantiated state variables is used in the 
behavior rule section. State Variables (or simply States) are explicitly defined before any 
15 of the Metric rules. A state variable definition has the form: 

StorageType StateType Varld [=InitState] EventThreshold 



20 



Where: 



StorageType is the optional storage class designator and can take on the same 
properties as the locally defined working variables (Dynamic, 
External, Shared, or Static). If not specified, Dynamic is assumed. 

25 StateType is the state of the variable. This can be Boolean, Enumerated, or 

Fuzzy. Only Boolean states are available in the first release of 
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Aware. If a StateType is not specified, then Boolean is assumed by 
default. 



Varld is the name of the state variable. The name can be one to thirty-two 

5 characters in length (and must start with either the underscore or an 

alphabetic character). Variable names are not case sensitive. Each 
state variable name in the BeX must be unique. If the state name has 
the external data storage attribute, it must be declared as shared by 
all other BeXs except the one where it is originally defined. A static 
10 and external state can also have an initialization value (this value is 

assigned only once - when the BeX is compiled and loaded). Only 
static and external can be used together. 

InitState The initial or default value of the state. If not specified, then FALSE 
1 5 is assumed. 

EventThreshold The clock as well as sampling density necessary to actually 
affect an event transmission. As an example, we might have an 
event threshold of (10,60sec) meaning that this state variable must 

20 be activated ten times in a sixty second period in order to actually 

transmit an event outside Aware. On the other hand, our threshold 
might be a simple sampling density (26), in which case the 
threshold represents the average of the past 26 values. Note that a 
sample of (1) is equivalent to a clock of (1,0) meaning that the state 

25 variable is activated once in any time period. This is equivalent to a 
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point sample. If an EventThreshold is not specified then (1) is 



assumed as a default. 



State variables provide the connection between metric rules and the behavior rules. 
5 Generally, behavior rules operate on the instantiated values of the state variables 
established by the metric rules. A collection of state variables declared in a BeX could 
appear as: 



15 Attempting to define an initial value to a shared state is an error. Since the values of the 
state variables are not established until all the metric rules have been executed, the default 
value of a state variable can also be the value of a locally defined or shared variable. 
Aware will perform automatic type casting between variables assuming the data types are 
translatable. 

20 Metric rules assess the state of the application by evaluating data values against a 

collection of thresholds or intervals. Metric rules provide a form of mapping between 
thresholds and State variables (or simply States). This is illustrated schematically as, 



STATES { 



boolean 



CPUAlert ; 



DSKAlert= FALSE ; 



Shared 



SystemFault ; 



-> S. 



t 2 -> S 2 



n n 
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This mapping is done through a collection of procedural rules (by procedural, we mean 
that the rules are executed in a linear fashion, starting at the first rule and stopping at the 
last rule.) Each rule that has a true predicate initiates a state variable assignment. A metric 
5 rule is in the form: 

Ruleid if < Varld rel exp> [and|or] .... then <s Varld [=|is] sexp> 

Ruleid if < Varld rel exp> [andjor] .... then do; 

1 0 <5- Varld [=|is] sexp> 

<Rule> 
end if 




15 Where 

Ruleid is the unique identifier for this rule in the Metrics section. The rule 
identifier can be omitted if not needed (in which case the metric rules are 
labeled serially. This means that the first metric has a Ruleid of Ml, the 
20 second has a Ruleid of M2, and so forth.) 

Varld is the name of variable defined in the Variables section of the BeX. The 
name can be one to thirty-two characters in length (and must start with 
either the underscore or an alphabetic character). Variable names are not 
25 case sensitive. 
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rel is a relational operator. This is any of the graphic or lexical 
representations of the Boolean relationals: equals, less than, greater than, 
less than or equal, greater than or equal, contains, omits. The word not 
can be used to generate the complement (as well as the graphic and 
lexical for not equal). 



exp is a predicate expression involving either arithmetic, logic, and string 
operators, constants, functions, or other variable names. Normally this is 
the constant or variable value associated with some metric. 

And|or is a logical connector between multiple antecedent expressions. Any 
number of expressions can be coupled to form a valid antecedent to a 
metric rule. This is often needed when a state variable is dependent on the 
condition of two or more data values (such as CPU consumption and disk 
space availability). Parentheses are used to specify the order of evaluation 
(which is normally left to right). 

SVarld is the name of a unique state variable in the BeX. Each state variable is an 
extended or interval Boolean variable that is normally assigned the value 
TRUE or FALSE (these are built-in Aware states). 

Rule is a nested rule within the do... end block. This rule has the same syntax 
as the top-level rule. 
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Metric rules form the foundation logic of the application management policy. They 
compare the current state of an application's behavior as well as selected environmental 
conditions against minimum or maximum or desirable thresholds (or ranges). When a rule 
antecedent expression is true, the then part (or consequent) of the rule is performed. The 
5 consequent set or instantiates the value of one or more state variables. Note that a rule can 
set multiple state variables or it can apply nest conditionals by enclosing the collection in a 
do. . .end block. As an example, 

METRIC RULES { 

10 If iCPUUsed > 80 then CPUAlert = TRUE; 

If iDiskRem < 2 0 then do; 
DSKAlert = TRUE; 

If CPUAlert then StablityAlert = TRUE; 
End if 

15 } 

Behavior rules generally (but not exclusively) work on the pool of state variables 
established by the Metric Rules. In this discussion we concentrate on the use of States, 
however, a behavior rule can also interrogate BeX Objects - variables, states, and rules 
20 contained in shared or dependent Behavior Modules. This use of Behavior rules is 
discussed later in the document. Cast in the form of if-then rules, behavior rules provide a 
functional mapping between collections of states to a unique event. This is illustrated 
schematically as, 

25 f(S i9 S k9 ... 9 S x )->Ej 



-37- 





Panacya Ref: Bex:^^ 
L&ARef: Ol-PAN-04 



The purpose behind behavior rules is simple: analyze the collective state of the system and 
threw an event if the state is outside the performance model established by either a single 
behavior rule or a set of behavior rules. As Figure 10 illustrates, the behavior rules 
synthesize a set of states into an analysis of over-all performance and send an event when 
5 the performance is at variance with the prescribed behavior. 

Like the metric rules, evaluation is done through a collection of procedural rules 
(by procedural, we mean that the rules are executed in a linear fashion, starting at the first 
rule and stopping at the last rule.) Each rule that has a true predicate initiates a possible 
set of actions. A behavior rule is in the form: 



10 



Ruleid [Frequency =f 9 Severity=«] 



if <BeXObject rel exp> [and|or] .... then <action> 



Ruleid [Frequency = yiSeverity=w] 



15 



if <BcXObject rel exp> [andjor] .... then do; 



<action> 



<Rule> 



end if 



20 



Where 



Ruleid 



is the unique identifier for this rule in the Behaviors section. Each 



behavior rule must have an associated rule identifier. This rule 



identifier is used in the automatic tracking facility, the agenda 



25 



manager, and the event protocol dispatcher. 
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Frequency is an integer value in the range [0,n]. Where "n" can be an 
arbitrarily (but not unreasonably) large number. The frequency 
attribute indicates how often, in seconds, the rule will be fired. Thus 
5 freq=10 indicates that the rule is fired every ten seconds. When 

freq=0> the rule is fired continuously. When freq— 1, the rule is 
disabled. 

Severity is a rating between [0,1]. Zero indicates an information level rule 
10 only. A one indicates a rule reflecting a fatal condition in the 

application (or a condition that can lead to application instability). If 
not specified then [.5] is assumed. bAware aggregates the severity 
level of incoming rules form the same BeX. 



1 5 BeXObject Any local Variable (a Varld) or any properly qualified object drawn 

from the dynamic pool of active Behavior Modules (all the related 
Behavior Execution Modules or BeXs). Generally, for a self- 
contained BeX, the object is a Varld the name of any variable 
defined in the Variables section or any state variable defined in the 

20 State section of the BeX. Although the Behavior Rules are intended 

to access the state variables (and thus focus on the performance of 
the application or system), they are capable of interrogating any of 
the variables defined in the BeX. 
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rel is a relational operator. This is any of the graphic or lexical 

representations of the Boolean relationals: equals, less than, greater 
than, less than or equal, greater than or equal, contains, omits. The 
word not can be used to generate the complement (as well as the 
5 graphic and lexical for not equal). 



exp is a predicate expression involving either arithmetic, logic, and 

string operators, constants, functions, or other variable names. 
Normally this is the constant or variable value associated with some 
10 metric. 

And|or is a logical connector between multiple antecedent expressions. Any 

number of expressions can be coupled to form a valid antecedent to 
a metric rule. This is often needed when an action is dependent on 
1 5 the condition of two or more data values (such as CPU consumption 

and disk space availability). Parentheses are used to specify the 
order of evaluation (which is normally left to right). 



action is the result of evaluating and executing a true rule. Unlike the 

20 metric rules, which can only set the value of a State variable, the 

behavior rules can perform a variety of actions. Some of the actions 
include, 



25 



SendEvent Forms and transmit a general event message to the designated 
receptor site. Sending an event is the principal type of action 
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employed by the behavior rules and the general method of 
communicating with the outside world. The first parameter in the 
SendEvent action indicates the intended receiver. This is used to 
discriminate between hidden, local and external event patterns. 
5 Thus, 

SendEvent (Netscape ,....) 

Sends an event to the Netscape BeX on the local machine. Since 
10 this is a BeX-to-BeX communication, the event is automatically 

hidden. 

SendEvent (LECO_NTl ,....,) 

15 Sends an event to the local ecology scheduler on the current 

machine. This generates a Local event. A local event might be 
stored and forwarded by the target LECO. 

SendEvent (bAware ,....) 
20 SendEvent (GECO_SOLARIS8 , ...) 

Generates and sends an external event to either the bAware front 
end or to the designated (and remote) global ecology scheduler. 

25 ApplyRule Explicitly executes the specified rule. 
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WriteLog Writes a line to the Aware audit tracking and logging file. Issues an 
automatic commit. 

AcquireData Connects to the GDS and retrieves another package (parcel) of data. 



ExecScript Runs the named script 

A behavior rule can also change the value of some other variable 
10 through a simple assignment statement. Thus, if the thresholds are 

stored in locally defined (or external) variables, a behavior rule can 
change its own (or another BeX's) policy thresholds (or intervals). 



Rule is a nested rule within the do... end block. This rule has the same 

1 5 syntax as the top-level rule. 

Behavior rules form the core of the application management policy. They integrate the 
states of the metric variables (the state variables) into a logical edifice expressing a model 
of the application's preferred behavior (as one possible example). Behavior rules provide 

20 the policy analyst with the tools necessary to trap anomalous behavior, filter events, 
transmit events into the outside world, and modify its own operation. When a rule 
antecedent expression is true, the then part (or consequent) of the rule is performed. The 
consequent initiates one or more actions. Note that a rule can perform multiple actions or 
it can apply nested conditionals by enclosing the collection in a do... end block. As an 

25 example, 
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BEHAVIOR RULES { 

MyRulel. [Freq=10] If CPUAlert and DSKAlert then do; 
ExecScript "FreeTempSpace" 
5 If ExecScript . Status>0 then do ; 

SendEvent (bAware , aCRITICAL, INSUFFICIENT_RESOURCES , fcAppName) ; 
End if 
End if 

} 

10 

Individual BeXs are connected to an application. They measure the performance of 
the application against a series of baseline metrics. When a metric threshold is violated, a 
state variable is set. The behavior rules examine the collection of state variables to see if 
some action should be initiated (such as throwing an event). The internal state of a BeX 

1 5 (the values of its variables, the condition of its States, the execution status of its rules, and 
the nature of its event schedule) can be shared among other BeXs. As Figure 11 
illustrates, the relationships (or dependencies) between BeXs can be expressed using a 
wide variety of the internal controls. 

Thus, if two behavior modules share a common State (one owns the state variable, 

20 the other has access to its value) they are explicitly linked through this common state. The 
one that shares the state is the dependent BeX, the one that owns the state is the 
independent BeX. Sharing the current value of variables, the state of one or more rules, 
and the type or value of a scheduled event can also entangle behavior modules. And, as 
Figure 1 2 illustrates, a many to one (n: 1 ) dependency relationship can be created through 

25 multiple types of shared objects. 

Figure 12 also illustrates schematically two other fundamental concepts in building 
the dependency network: bi-directionality and multiple dependency points. Bi-directional 
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linkages mean that an independent BeX can also gain access to the control structures 
associated with its parent (dependent) BeX. This has significant implications for 
knowledge modeling as well as mechanizing the adaptive feedback tuner. Multiple 
dependency points simply means that a dependent BeX can be linked to one or more other 
5 BeXs through more than one control mechanism (such as through State variables and 
Rules or State variables and ordinary variables). 

The effector relationship linkages for a dependency matrix are established through the 
dependent or independent BeXs behavior rules. This means that the behavior rules can use 
the shared variables by qualifying the names with the name of the associated (owning) 
10 BeX. As an example, consider the following behavior rule, 

If bl.sl and b3.rl and b3.r4 and not b5.s3 then 
If this.s2 and this.s7 then 

SendEvent(myevent) 

15 End if 

End if 

Which says (in part): if state variable si in BeX bl is true (it was set by the tripping of an 
associated metric rule) and rule rl in BeX b3 was fired and rule r4 in the same BeX (b3) 

20 was fired but state variable s3 in BeX b5 is not true (that is, it's false) then execute this 
rule. This is a nested rule which then says: if local state variable s2 and local state variable 
s7 are true then send an event. The qualification "this** indicates that the object is a 
member of the current BeX. When no ambiguity exists between local and shared variables, 
the "this" qualifier can be dropped (although it is not an error to use it). In order to actually 

25 use shared control mechanisms, the names of the independent Behavior Modules must be 
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specified in the DEPENDENCIES statement of the current (dependent) Behavior Module. 
This concept is discussed below. 

As Figure 13 illustrates, sharing control mechanisms creates an explicit (an 
implicit) dependency among Behavior Modules. In this diagram, a hierarchal or tiered 
5 architecture is created. Each dependent behavior Module interrogates the control 
mechanisms of the BeXs "below" it on the tree. 

The actual architecture (more properly, the topology) of a Behavior Module 
network is synthesized out of the composite control mechanisms shared among the 
Behavior Modules and the ways in which the behavior rules use and set the shared objects. 
10 Figure 14 illustrates some example topologies. 

Each topology represents a type of deployed meta-control architecture. Because 
dependencies are specified only at the parent-child level (not across the entire topology), 
we can easily modify the deployed architecture. This means that BeX topologies can 
evolve from simple to more complex structures as the need arises. Dependency networks 
15 allow us to build Behavior Modules that analyze the states of multiple applications (or 
multiple tasks). Figure 15, as an example, illustrates a tiering of Behavior Modules based 
on State variables. 

In Figure 15, we see that BeX X2 has a behavior rule that uses State variables from 
three other BeXs. Accessing external state variables in other Behavior Modules provides a 
20 powerful and flexible and robust method of building complex, multi-tired management 
and analysis systems that can observe the behavior of large, complex systems. Using 
shared state variables a higher level BeX can detect anomalous or performance-specific 
conditions that are distributed across many applications. You should also note that the 
definition of "higher" and "lower" level is relative to the BeX's dependency relationships. 
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To use shared state variables three design conditions must be met: the state 
variables in the low level BeX must be declared as External, the same state variables must 
be defined as Shared in the higher level BeX, and the low level (or dependent) BeXs must 
appear in the higher level BeX's dependency (or topology) declaration. A BeX with 
shared. The following illustrates a Behavior Module with shared state variables and a rule 
that uses them. 



10 



15 



BeX ID : 
CREATED : 
PROCESS : 
DEPENDENT : 

// 

STATES { 

Shared Boolean 
Shared Boolean 
Shared Boolean 
Boolean 
Boolean 

} 



MyBeX 

10MAYOO 15:18:12 
C: \SVER01\SAP2\I\SHOES.EXE 
UrBeX, ThisBeX, ThatBeX, AnudderBeX 



QueueAlert 

That BeX (ResponseAlert) 

PagingAlert 

CPUAlert 

DISKAlert 



20 



25 



VARIABLES { 

int iCPUUsed 
int iDiskRem 

} 



RM . GETCPUTIME 
RM . GETDISKLEFT 



30 



METRIC RULES { 

If iCPUUsed > 80 then CPUAlert « TRUE; 
If iDiskRem < 20 then DISKAlert = TRUE; 



} 
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BEHAVIOR RULES { 



SysRuleOl. If CPUAlert and QueueAlert but not FagingAlert then 



Send Event (bAware , SysRuleOl , aCRITICAL, I NSUFF I C I ENT - RESOURCES , SAppName ) ; 



End if 



5 



An independent BeX can expose state variables in another BeX through an explicit 
declaration. This is illustrated in the declaration of the state variable ResponseAlert. By 
explicitly qualifying the variable name with the name of the BeX, we cause the target state 

10 variable to be prompted to a storage type of External. You can use this state just like any 
other state unless another shared state has the same name. In this case, the exposed state 
must be qualified (ThatBeX.ResponseAlert). 

Fig. 16 shows a mechanism in which BeXs may share information through a 
blackboard server. Each BeX may read or write information to the blackboard. 

15 A BeX may be used for special functions. For example, a specially coded BeXs 

called Adaptive- Support Behavior Modules (or ABeX) may be used for adaptive feedback 
control. Adaptive feedback control may be a top down process. Fig. 1 7 shows a general 
block diagram of adaptive feedback. System 1020 represents a collection of BeXs. A 
sensor array 1010 may observe and record how system 1020 reacts to different situations 

20 in service management. Such sensor data is sent to a tunerl040. Tuner 1040 is equipped 
with a set of objective functions that are related to expected system performance. If the 
recorded data about system 1020 does not match with the objective functions, tuner 1040 
initiates adaptive feedback control to tune system 1020. The tuning may be achieved by 
forcing the BeXs in system 1020 to revise the rules that are related to unsatisfactory 

25 performance. This process can be seen in more detail from Fig. 18. 
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In Fig. 18, system 1020 comprises, for example, three BeXs, 530a, 530b, and 530c, 
each of which is attached to an infrastructure component. In addition to these monitoring 
BeXs, a set of specially coded BeXs called adaptive-support BeXs or AbeXs, 1110a, 
1 1 10b, 1 1 10c, are used for adaptive feedback control. Each of ABeXs comprises inter- 
5 connected external (shared) state variables that can be accessed by sensor 1010. Data 
from sensor 1010 is evaluated by an evaluator 1030 against a set of objective functions. 
The objective functions may be multiple dimensioned. The evaluation may be performed 
by computing a set of Euclidean distance between the sensed states and the target states 
(specified by the objective functions). The distance is used to determine the adjustment to 
10 be made. Tuner 1040 sends adjustments back to the associated BeXs to update their 
internal states. 

The plant tuner is a fuzzy logic controller. The controller consists of fuzzy if-then 
rules (arranged in a connectionist architecture representing a state transition machine). 
Each ABeX contains a collection of fuzzy rules, which measure the performance of the 
15 system and report the degree of compatibility with the objective function (themselves 
organized as fuzzy numbers). Fuzzy rules employ variable window lad horizons so that 
changes in the system state can be accurately measured. Quantification of the objective 
function is achieved through several steps: 

20 • Centroid defuzzification of outcome space 



• Conversion of the defuzzified outcome to a fuzzy number with the appropriate 



expectancy interval 



Comparison with fuzzy objective using inverse of Euclidean distance as the 



similarity measurement control. 
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• Run fuzzy rule base with each similarity coefficient to determine how to adjust 
machine parameters. 

Nearly all the state variables in the ABeX systems are shared. These variables are 
identified by the leading underscore in their name (_CPU Alert). 
5 Sensor 1010, tuner 1040, and the evaluator 1040 may reside in a BeX where the 

adaptive feedback control is initiated. Fig. 19 shows an exemplary adaptive feedback 
control among a set of BeXs. In Fig. 19, BeXi 1210 is attached to an infrastructure 
component, for example, an application that computes the trend of a stock price. When 
the memory use of this particular application goes up to 35% on a local system, it may 

10 trigger a particular behavior rule. Since at component level, BeXi has no knowledge about 
the higher level business need for the capacity of the memory of this local system, it has 
no way to know what kind of impact this abnormal behavior will cause on the overall 
eService performance. So, the behavior rule associated with this stock price application 
may conservatively trigger an action to simply report this abnormal behavior to a higher 

15 level BeX. 

Based on the behavior rule of BeXi, this abnormal event is reported to an 
integration BeX 1220, located, for example, in local ecology pattern detector. The local 
integration BeX 1220 may still not have enough business process knowledge to estimate 
the severity of this particular abnormality with respect to the eService. So, it may further 

20 forward the event to a global integration BeX 1230, which may be located in a global 
ecology controller. Since BeX 1230 sits at the eService level, it is equipped with the 
knowledge about the business process of eService. Based on such knowledge and the 
reported events from all parts of the eService infrastructure (BeXs 1240, 1250, 1260, 
1270, and 1280), it may estimate or detect a significant performance degradation at 

25 eService level. By analyzing the reported abnormal events, BeX 1230 may decide that the 

-49- 




Panacya Ref: Bex: 




L & A Ref: 01-PAN-04 

major factor responsible for the overall performance degradation is the lack of memory 
space at the system where the stock price application is running. It may further identify 
that lack of memory is due to the fact (according to the event reported from BeXi 1210) 
that a particular application has used up a large chuck of memory on that system and 
5 caused shortage of the memory. In addition, it may recognize that 1210 and 1220 are the 
BeXs that are responsible for that particular application. 

The unexpected performance degradation and the identified cause may trigger BeX 
1230 to decide that adaptive feedback control is necessary. Since it is clear at this point 
that BeXi, who is directly responsible for the faulty application, and all the BeXs that 
10 simply routed the information about the abnormal behavior of the faulty application fail to 
realize the severity of the misbehavior, iBeX 1230 initiates adaptive tuning by sending an 
updated rule to both BeX 1220 and BeX 1210. The rule is to be used to replace the 
conservative behavior rules that are previously used by both BeXs 1210 and 1220 
regarding this particular behavior. 



single application exceeds 30%, then the application should be re-ranked with a much 
lower priority. It is also possible to simply instruct to kill such applications. The former 
strategy provides more space to conduct incremental learning. It is also possible for BeX 
1230 to initiate a feedback control by sending a generic behavior rule to all the BeXs 

20 (1210, 1220, 1240, 1250, 1260, 1270, 1280) that restricts the use of any application at any 
time instance to maximum of 20% of total memory capacity. 

Adaptive feedback control can be performed within different scopes. While the 
example shown in Fig. 14 is from the eService level all the way down to component level, 
it is also possible to initiate from local ecological level to component level or even among 

25 component level BeXs. It is flexible, dynamic, and learning based. It may be initiated 



15 



In the updated rule, it may explicitly indicate that if the memory usage of any 
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when an unexpected performance degradation is due to the misjudgment from BeXs due to 
inexperience. It may be initiated because of other reasons. With the capability of self- 
adapting, the entire eSerive management system 100 is capable of continuous evolving, 
during its operation and based on accumulated experience, towards an optimal 
performance state. 
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